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Statistical Naturalness and non-Gaussianity in a Finite Universe 
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We examine tiie behavior of n-point functions of the primordial curvature perturbations assuming 
our observed universe is only a subset of a larger space with statistically homogeneous and isotropic 
perturbations. We show that if the larger space has arbitrary correlation functions in a large 
family of local type non-Gaussian statistics, sufficiently biased smaller volumes will have statistics 
from a 'natural' version of that family with moments that are weakly non-Gaussian and ordered. 
Depending on the total size of the universe and the scale-dependence of the power spectrum, typical 
subsamples the size of our observed volume may be sufficiently biased to make weak non-Gaussianity 
whose dominant term is consistent with the usual local ansatz very likely, regardless of the statistics 
of the original field. We also argue that although the dominant shape of the momentum-space 
correlation functions may not be identical in different volumes, the characteristic behavior of the 
squeezed limit of the bispectrum is independent of the bias of the subsample. 

PACS numbers: 98.80.Bp 
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Measurements of the primordial density fluctuations 
are the primary tool to test the paradigm of inflation- 
ary cosmology and to distinguish between the many pro- 
posed particle physics scenarios for inflation. As our 
ability to test the statistics beyond the power spectrum, 
collectively called non-Gaussianity, becomes more ad- 
vanced, new questions arise: what is the best way to 
test non-Gaussianity? What measurements would point 
definitively to particular models of inflation? So far the 
proposed approaches to address these questions rely ei- 
ther on a particle physics notion of naturalness for non- 
Gaussianity, either for the inflaton fleld [ij or for the fluc- 
tuations [2| , or on mode expansions to try to capture any 
non-Gaussianity that is observationally accessible Q . 

Here we point out a distinct and complementary way 
of thinking about naturalness. We suppose only that the 
universe is considerably larger than what we see (which is 
the natural outcome in many inflation models) and that 
there exists a homogeneous and isotropic spectrum of pri- 
mordial fluctuations in the gravitational field on all scales 
in the entire volume. Then, for a given choice of statis- 
tics in the large volume, we can ask what statistics are 
typical to spatial subsets of the size of our universe. In 
this report, we will find a notion of statistical naturalness 
for typical small volumes where a family of well-behaved 
correlation functions is generated from a parent volume 
with arbitrarily fine-tuned statistics in the same family. 
Our results build on previous work on the non-Gaussian 
halo bias from the standard local ansatz ^ and in gi^^ 
type non-Gaussianity [1], and can be used to more pre- 
cisely characterize observable features of multi-field in- 
fiation models @. This work is an extension of ideas in 
01 and is similar in spirit to recent results in (8l-[lo|. We 
illustrate this point with a simple first example, showing 
that the local ansatz for non-Gaussianity, with an am- 
plitude that is weakly non-Gaussian and whose principle 
term is quadratic in the underlying Gaussian, is statisti- 



cally natural. 

Consider a large volume characterized by side length or 
radius L and a smaller volume characterized by scale M . 
Here we will generally have in mind that M is the scale of 
our currently observably universe and L the scale of the 
entire universe (which we assume to be finite) . Note that 
L may also just be the largest scale on which this pre- 
scription for the fluctuations is trusted. It is also some- 
times useful to consider L to be the size of our observable 
universe and M the scale of an N-body simulation or of 
some local region whose Large Scale Structure we are 
interested in. We define the curvature perturbation in 
each region as the fractional shift to the scale factor a 
describing a background, homogeneous FRW universe: 



a{x) = aL(l -I- C,{x)) , 
= aM(l + CW) 
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where |C|, |C| < 1 by definition. We define a maximum 
wavenumber /cmax from the smallest scale we smooth over 
in defining the fluctuations. The average over a subsam- 
ple volume M of the perturbations defined with respect 
to the volume L is (C) m ■ These quantities are related by 

1 + C(x) = (i + (C)M)(i + C(a;)) , xeVolM- (3) 

The power spectrum in either volume is defined in terms 
of the two-point function, (CkiCk2) = (27r)^(5'^(ki -|- 
k2)P(A:), and the dimensionless power spectrum is 
V{k) ^ ^P{k). 

Local Ansatz. Consider a simple form of the local 
ansatz where the curvature perturbation, C,{x) is a lo- 
cal non-linear function of a Gaussian field C,g{x)- In the 
large volume, suppose the curvature perturbation is 



ax) = NiCG{x) 



|^2CG(a:)' + j.Ni^Gixf + (4) 
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where we implicitly shift the field so the mean (C) 
throughout the large volume is zero. The original lo- 
cal ansatz [llj set Ni — 1 and N2 — ^Inl to define the 
non-linearity parameter /nl- We will take the Ni to be 
constants. The real-space variance of the Gaussian field 
Cg is 



L-1 



(2^) 



(5) 



where Pq is the power spectrum of Cg- It will also be 
useful to define (Tq^ and CTqs, with limits of integration 
changed to {L^^ , M^-^) and (Af~"^, fcmax), respectively. 

Consider ((x) for x d M, a subsample. Dividing 
Ccix S M) into long and short-wavelength parts gives 
Cm = Ci,M + (s,M- Here (i^m is the real space field 
smoothed over region Af and is similar to (C) m defined 
above, up to the difference between the real space and 
Fourier space top- hat window functions. Following 
this gives the local background, a constant in any partic- 
ular subsample M, 



(6) 



where B = C,gi,m/^o is a measure of bias in a given 
subsample M . Similarly, 

C.(x) = N,Qgs{x) + ^iV2C^,(x) -t- j^ Mlsi^) + - (7) 

contains the short-wavelength fiuctuations. However, the 
coefficients iV„ now depend on the local background: 

N„{B) ^N.n+ Nn+iaoB + ^Nn+2alB^ + - (8) 
Then the curvature perturbation in any small volume, 
C(x e M) = Cg{x) + ^iV2C^(x) + i7V3CS(^) + (9) 

is related to C by C = Cs/(1 + Cz.m), which follows from 
and the long- and short-wavelength split above; here 
we have set A^i = 1. Reading off the Gaussian part of 
each expression, we have 



Cg 



1 + 0. 



-Qgs, 



(10) 
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and consequently, the contribution Vg to the power spec- 
trum from the Gaussian part Cg is different in the small 

volume, Vg = \-r^ — ) ^g- We also define the real- 

V H-Ci,M / 

space variance of Cg i 
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Paik). 
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The Nn coefficients can be expressed in terms of the iV„ 
coefficients. 



Nn{B) 



(i + Ci.MiB)Y 

N{\B) 



(12) 



which can be verified by comparing ([7|) with ([9]) and ()10|) . 
The Nn, and hence the locally averaged n-point func- 
tions, will vary among subsamples due to variation in 
the bias B, which is itself drawn from a Gaussian dis- 
tribution with variance (B^) = ali/a^ < 1. In the hmit 
M'^ ^ fc^ax, ^ 1. 

The level of non-Gaussianity in C introduced by any 
one of the Nn coefficients can be quantified by 7V„(Tq~^. 
Using ((T0| and (|T2|) . it is easy to show that the corre- 
sponding quantity NnCQ^^ for the small volume is given 

by 



Xn{B) EE NnaJ^-^ = ^r'^n^o. 



(13) 



The increase or decrease in the level of non-Gaussianity 
is determined by the same factor of Ni^ for all terms, up 
to higher order corrections in the iV„, as expressed in ([8|). 
If we truncate the series at two terms, where N2 = ^/nl, 
we find 
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fNLf^a = Inl'^Os ( 1 + -^Inl^qB 



(14) 



Generically, if the series in the large volume L was a good 
Taylor expansion with Nn+iCc < Nn, the coefficients in 
volume M will be not too different from those in L. For 
unbiased subsamples, where the long wavelength modes 
happen to average to zero, B = and the statistics of 
the subsample are identical to those of the volume L. 

The running of the parameters of the series with the 
background bias B can also be expressed in differen- 
tial form, analogous to renormalization group equations. 
From ([8]) we have o-Q^dNi/dB = Nn+i, and from ([6]) we 
have cjQ^dCi/dB = ciQ^dNo/dB = Ni. One can show 
from (ll31) that 



dB 



Xn 



-Xo. 



(15) 



This equation is valid for any set of initial conditions 
A„(0), that is, for any set of coefficients Nn, although 
one must take care when i? = in cases where there is 
no linear term in the large volume {Ni = 0), because 
we have normalized the linear term to have a coefficient 
1 in the small volume Writing a similar differen- 

tial equation for the dimensionless (connected) moments 
Mn = (C(a;)")c/(C(a;)^)"^^ would avoid that problem 
and be more complete, but it is also more notationally 
cumbersome so we do not write it here. 

Weakly Non- Gaussian Ansatz. Let us now consider a 
case where the series in the volume L is fine tuned, with 
some coefficients Nn unusually large or small. Consider 
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first the case where the Nn with p > n > 1 are zero in 
the large volume L so that after the linear term the series 
starts only at order p: 



c = 



Cg + ^Np^G + t^^p+^Cg"' + P > 3. (16) 



p\ 



ip+1) 



By "weakly non-Gaussian" we mean that the linear term 
dominates, so jjNpV^J''^^^'^ < 1. To ensure a simple 
behavior of the highest moments, we also assume that 
the nonzero terms become smaller by the same ratio 
f - (iV„+i/7V„)^^/' « 1, and that l^Npf^J-^^^^ ^ 
~p-i^ This scenario gives a nearly Gaussian field in vol- 
ume L whose non-Gaussian moments have some unusual 
properties. For n > p the dimensionless moments Ain 
scale like Ain oc r"^^. However, the moments with 
n < p + 1 are not necessarily ordered (eg, Mn it Mn+i 
is possible). Defining A = ctq/Vg, with Af"^ <^ 1, 
the moments with n < p behave as A^„ oc r'^A^ for 
(p, n) = (odd, odd) or (even, even), and Mn oc rP^^A^^ 
for {p,n) — (even, odd) or (odd, even). 

However, in subsamples the long-wavelength modes 
will generate the missing lower order terms: 



(17) 



With the restriction that the terms with n > p in 
the large volume fall off according to f ^ 1, Nn ~ 
Np{l + C/(B))"-1(ctoB)p~"/(p - n)! for p > n > 1, and 
A''i = 1. Interestingly, the correlation functions (C") are 
not of order TV^^"^ but are instead dominated by the con- 
tribution from Nn-i- 

For sufficiently biased subsamples, the series of dimen- 
sionless moments can be written, for 2 < n < p, 



Mn CC C[rJl 



NL '^Oj 

e// _ 1+6 (B) 



(18) 



NL 



doB 



That is, the 



where C oc f^^^, and / 
level of non-Gaussianity and scaling of the moments in 
sufficiently biased subsamples is determined not by the 
original parameters Nn, but by the local background 
B. (In contrast, for n > p, Mn oc TYn^icrQ""^-*^^.) 
Because f^Jl^a — o'o/o'o-B, we have Mn+i/Mn ^ 
'hC'^^^^^^y^^ ■ Consequently, for sufficiently biased 
subsamples, the n < p moments will fall off as n in- 
creases. We will see that this tends to be the case for 
subsamples containing fewer subhorizon modes than the 
number of superhorizon background modes. Note also 
that C is still only weakly non-Gaussian. 

Strongly Non-Gaussian Ansatz. Next, consider a case 
where the statistics in the volume L are very non- 
Gaussian: 



C = + (^^.+iCg+' + ..., p>1. (19) 



where again we assume for simplicity that the first term 
in the series dominates. In this case the moments Mn 



are all of 0(1). In the smaller volume M the entire local 
ansatz series is regenerated, but with 



" {Np{doB)py^-^ {p~n)l ' 



(20) 



Now the linear term is regenerated like all the other 
terms, and the correlation functions (C") are of order 



Mn^[rJl<Jor-\ 2<n<p 



(21) 



where flf/ — -'-t^';*^'-"^^ . Although there is no longer an 

additional small factor suppressing the moments, as in 
^TE\i . the scaling of the moments is otherwise the same 
as described above (for n > p, the moments again fall 
off with the original scale f). For biased enough sub- 
samples the moments can be small and fall off rapidly; 
even a strongly non-Gaussian model in the large volume 
generates subsamples that are weakly non-Gaussian. 

An easy way to see that Gaussian statistics are recov- 
ered on small scales is to consider the simple case C = Cg- 
Breaking Cg into long and short wavelength modes, we 
have ( — (^^1^ + 2(giCgs + Cgs- the number of back- 
ground modes is much greater than the number of short- 
wavelength modes, ln(L/M) ^ ln(M/i?), then as long 
as CG;,Af ~ (Cg/)^^^' linear term will be much larger 
than the quadratic term. In general, when the scale of the 
subsamples is small enough, typical subsamples will be 
sufficiently biased to regenerate the familiar local ansatz. 
(In the case of a scale-dependent power spectrum where 
longer wavelength modes have greater power (us < 1), 
the bias B = Cgi,m/S'o from the background increases 
more rapidly as the subsample size M is decreased, caus- 
ing the linear term to be boosted in size and the field ^ 
to be more Gaussian.) 

Behavior of n-Point Function Shapes. In specifying 
C, we determine shapes for the n-point functions on all 
scales. In subsamples, these shapes are still present, but 
(as in the two examples considered here) can be dom- 
inated by soft limits from higher 7i-point functions in- 
duced by the background. One might think that ar- 
bitrarily non-linear terms in C could give arbitrary k- 
dependence to the n-point functions. Then the usual 
local-shape n-point functions could be recovered in suf- 
ficiently biased small subsamples from very different 
shapes in the large volume. In the highly non-Gaussian 
case (|19p . the n-point functions may involve many loops 
(momentum space integrals), whereas in small subsam- 
ples the lower order terms allow the dominant shape to 
come from tree diagrams. Even for the fine-tuned nearly 
Gaussian case p6)) . it is possible for n-point functions to 
be dominated by contributions with many loop integrals, 
if we remove the earlier requirement that higher order 
terms fall off by f ^ 1 . 

To address this possibility, let us consider the 
p-loop contributions to the two-point function 
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from a given higher order term in the series: 
((7Vp+iCS+')k.(iVp+iCl+^)k,) G (Ck.Ck.). This con- 
tribution can be expressed in the form 



pp-loop ^ 



1 



i=l 



n 

.i=l 



1 



1 



(22) 

Wc find that after evaluating m such integrals starting 
from the right, with a momentum cutoff L^^ for all fac- 
tors in denominators and taking the limit L^^ pi <C 
k-aiax, an additional factor of ln'"(pm+ii) appears, giving 
pp-ioop ^ k^'i\n^{kL). Additional terms are also intro- 
duced, but either have weaker momentum dependence 
or can be discarded in the limit Pi/fcmax *C 1. The ap- 
pearance of the scale L in these expressions should not 
be interpreted as measurability of L, since its value is 
completely degenerate with the amplitude of the power 
spectrum, the spectral index, and analogous quantities 
for higher order correlation functions (see, eg, [12|). 

This analysis can be generalized to the three-point and 
higher n-point functions; an n-loop contribution to the 
bispectrum will involve terms of the form [isj 

1 



1,3 1,3 



ln"i (fcii) In"^ (fczi) In"^ (min(fci, fca)^) + perms., 

(23) 

where ^to^ = n and mi. 2, 3 are the number of loops 
coming from contractions between different pairs among 
three terms in the series contributing to the bispectrum. 
In the squeezed limit, fci -> and ^2 — k^, only terms 
with mi — Q will contribute, so the squeezed limit will 
still be characterized by the usual fcf dependence. 

In conclusion, then, a local ansatz Q with arbitrar- 
ily fine-tuned coefhcients A'^^ can contribute additional 
logarithmic fc-dependence to n-point functions, but the 
behavior in the squeezed limit remains unchanged; this 
characteristic behavior of the local model cannot be 
erased by fine-tuning the coefficients. 

The question of shape is also more complex for higher 
n-point functions in that there are more tree level shapes. 
For the local model, there are two trispectrum shapes 
typically discussed: Tg = gNLPG{ki)PG{k2)PG{k3) and 
Tr = TNLPG{ki)PG{k2)PG{\^i +^3!), with sums over 
permutations; in our case t^l — (f /wl)^- For the nearly 
Gaussian ansatz (jl6p . cubic and quadratic terms will be 
regenerated, with gNh/ f^L ^ r~'^P~^^ 1, so the Tg 
shape will dominate the T-r shape in sufficiently biased 
subsamples. In the large volume this is also true; the 
leading term {(gmCgmCgm (^pC^)k4) (or p-t- 1 for even 
p) has the same momentum dependence. 

For the highly non-Gaussian ansatz ([19]), in sufficiently 
biased subsamples the quadratic and cubic terms are 
large and gNL — 0{fj^]^) (assuming p > 2), so the two 
shapes contribute equally. In the large volume this is also 
true because the loop integrals in the trispectrum can be 
contracted diagrammatically in different ways, contribut- 



ing terms that approximate both tree level shapes |l3|. 
As an exception, for p = 2 the tnl shape dominates in 
both volumes because the quadratic term is abnormally 
large compared to the cubic term. 

This generalizes to higher n-point functions as well: 
the tree-level shape(s) that are dominant throughout the 
large volume will also dominate in sufficiently biased 
small subsamples. For the shape from (Ccki x 

••■CG,k„_i(A„„iC"^^)k„) will dominate for any n-point 
function on all scales; for ()19|) . contributions to n-point 
functions from Cg~^ terms (the regenerated missing 
terms) will dominate, and the momentum dependence 
in the large volume will be similar. 

Conclusion. From these examples we conclude that 
for local non-Gaussianity a weakly non-Gaussian series, 
with terms falling off by a characteristic ratio r and the 
moments following a hierarchical scaling A4„_|_i/A^„ ~ 
r, is statistically natural: such a series is recovered in 
typical subsamples on sufficiently small scales from an 
arbitrary set of local terms in the large volume. Any 
model for local non-Gaussianity can be expressed as a 
superposition of the two specific cases considered here. 

These results suggest two important things for under- 
standing what limits on or detection of non-Gaussianity 
imply for theories of the primordial universe. First, the 
form of the local ansatz is protected against changes of 
scale: although the finiteness of the observable universe 
means a one-to-one map between observations and the- 
ory parameters may not be possible, subsampling does 
not lead to correlation functions with arbitrary shape in 
momentum space. Second, these results are independent 
of a specific dynamical origin for the fluctuations and sug- 
gest that purely statistical arguments could be used to 
define a space of most plausible non-Gaussian models to 
be tested against observations. Our results are comple- 
mentary to other statistical restrictions on the relative 



size of certain moments [14| . Extensions and applica- 



tions of this result for local, scale-dependent local and 
non-local non-Gaussianity are in progress. 
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